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Abstract 

The World Wide Web has evolved from a distributed hypertext system to a platform-independent graphical user 
interface that integrates many network services. So far, its technology has restricted it mainly to applications for 
information retrieval. 

As networks become ubiquitious and more and more users have a permanent connection, there is an increasing demand 
for other network services, such as real-time data feeds, group communication, and teleconferencing. So far, these services 
have been provided by various proprietary' software systems, which were hard to set up and use, and thus not very 
successful. 

Integrating real-time group communication services into the World Wide Web is a natural way to make them more 
accessible and will take the Web a step further on its way to becoming the universal network application. 

In this paper, we describe functionalities required for these services and present an implementation based on Sun 
Microsystem's Java 2 programming language. We focus on the high-level functionalities and abstractions, but also describe 
an object-oriented programming model for group communication systems. ® 1997 Published by Elsevier Science B.V. 

Keywords: Integration of real-time channels: Innovative applications; Programmatic and specialized user interfaces 



1. Overview 

Most people agree thai communicating is the most 
important activity in today's information society. We 
spend a very significant portion of our lime — work 
and leisure — communicating in groups with other 
people, either directly (face to face), or remotely (via 
telephone, email, or videoconferences). Therefore. 
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finding more efficient means for communication be- 
tween people is a very important area of research. 
We believe that the most significant problem that 
keeps people from communicating with one another 
via teleconferencing software systems is their limited 
accessibility. 

The World Wide Web has become the most conve- 
nient way to access information in the Internet. The 
main reason for this is that WWW browsers integrate 
different network services into a common, easily 
accessible, platform-independent user interface. 

Developing group communication services that 
support conferencing from within the WWW is 
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an excellent way to improve accessibility of such 
services. This should be done in a way that does 
not require specific proprietary features of WWW 
browsers; it should be built on general mechanisms 
provided by all browsers. 

So far, the WWW does not support most of the 
features required for these kinds of applications. 
For example, there is no mechanism for server-ini- 
tiated activity. HTTP is not intended for real-time 
data feeds — it is state- and connectionless. Using 
CGI scripts and forms, the user interacts with the 
Web server rather than the browser. Active contents, 
such as Java Applets, solve these problems. They 
are a very general mechanism supported by most 
browsers, yet they are powerful enough to develop 
serious network-aware applications. 

In this paper, we will describe some general de- 
sign issues for group communication systems for 
the WWW and present an implementation of such a 
system. The paper is organized as follows. Section 
2 provides a short introduction to group communi- 
cation. In Section 3, we discuss some design issues 
for the system. We present our proof-of-concept im- 
plementation, called Promondia, in Section 4 and 
discuss related work in Section 5. Finally, Section 6 
summarizes our results. 

2. Group communication 

Group communication is the exchange of infor- 
mation between a group of participants in a session -\ 
Participants may have different roles in a session. In- 
formation is exchanged using certain media, chosen 
according to availability and suitability for the infor- 
mation to be transferred. 

Different types of group communication sessions 
have evolved in the past in order to reach common 
goals as efficiently as possible. These goals include 
entertainment, decision making, learning, and plain 
information exchange about a common interest. 

For example, a common pattern is a panel dis- 
cussion involving a host, several speakers, and an 
audience (see Fig. 1). Typically, all these participants 
are in the same room. Host and speakers communi- 
cate to the audience and among one another using 
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Fig. I. Session of a panel discussion. 



audio and gestures. The audience provides feed- 
back using facial expressions and gestures. The host 
may let individuals ask questions. The goal is to 
communicate information the audience is interested 
in. 

This type of group communication is based 
mostly on the availability of media, not on their 
suitability. New technology provides the opportunity 
to choose more appropriate media to reach a given 
goal more efficiently. Using this new technology in 
the example above could mean that the audience, 
distributed all over the world, can provide detailed 
feedback without disrupting the speakers. This feed- 
back can be summarized automatically, the host and 
the speakers can see immediate statistics and react 
accordingly. Anonymous questions can be submitted 
to the host, who could choose to forward them or 
discard them. 

Group communication can roughly be divided 
into two categories, synchronous and asynchronous: 
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Asynchronous communication can be viewed as 
synchronous communication with one virtual partic- 
ipant (like a VCR recording a TV show) that stores 
information and provides them when requested by 
another participant. Thus, synchronous communica- 
tion is the more general concept of the two. 

Asynchronous communication can easily be inte- 
grated into the WWW using HTML forms or Java 
from ends and CGI scripts or database servers. Many 
products and services have been developed for this. 

Uni -directional real-time data feeds can also be 
seen as a special case of group communication with 
one or more participants, which are simply data 
sources, and a large number of passive recipients. 

In this paper, we locus on real-time, synchronous 
group communication and on how it can be inte- 
grated into the World Wide Web. 



3. Design aspects for group communication in the 
WWW 

Most systems for group communication in use 
today do not clearly separate all of the four com- 
ponents that make up their characteristics: network 
topology, network services, programming models for 
and implementations of session types. 

There should be a clear distinction between these 
components. Wc will discuss the first three in this 
section and the last one in Section 4. 

3.1. Client/sender approach 

Certainly, it is helpful to make a clear distinction 
between a server subsystem and clients when de- 
signing an architecture for group collaboration in the 
WWW. The server subsystem provides functionality 
common to all session types, such as logging, access 
control, and directory services. For each session, it 
also contains a session-server object that coordinates 
the communication between the clients and deter- 
mines the contents distributed in that session. This 
session-server object may be generic or session-type 
specific. 

A client runs on the participant's system and pro- 
vides the front end that presents the session to the 
participant. Different types of clients reflect the dif- 
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Fig. 2. Servers with different session-server objects. 



ferent roles for each session type: when a participant 
joins, the appropriate client is instantiated. 

For sessions with a large number of participants 
(> 1000). a scalable architecture is required. This can 
be achieved using a distributed server subsystem. 
A network of servers share a common name space 
and host server proxies on behalf of their clients. 
These proxies are also session-server objects. They 
may be generic and simply forward information to 
and from a central main server, or they may contain 
semantics for the type of session they serve (e.g., 
they summarize the local results of a voting session). 

Since the the session -server objects may be loaded 
via the net and instantiated dynamically, a security 
model is required that restricts their access to local 
resources. Therefore, Java's coarse-grained security 
model is not suitable. The more fine-grained security 
model described in [21 may be appropriate. 

3.2. Underlying network sendees 

The system should be based on a reliable network 
service to set up the software components and to 
exchange session and user management data. How- 
ever, for the actual information to be communicated, 
it should be possible to use other network services 
based on the requirements of the particular type of 
communication. In this case, the system network is 
used to exchange the references required to use the 
other network services. 

Network services that could be used include TCP. 
UDP. IP Multicast, and more specialized protocols 
such as RTP |3| or TIBCO Inc.'s Rendezvous in- 
formation bus 14]. A very early version of our 
system [5] used IRC [1| as the messaging back- 
bone. CORBA Events services 16] could be used 
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for strongly typed, language-independent event mul- 
ticasting. 

3.3. Programming models for implementing session 
types 

Many programming models for real-lime collab- 
oration have been suggested. The simplest and most 
basic mechanism is message passing between the 
objects associated with the participants. A conve- 
nient API can offer asynchronous multicast for re- 
mote method invocation on all or a subset of partici- 
pants [7]. This model is very suitable and intuitive to 
use for volatile communication-oriented tasks, such 
as chatting. It can be refined by distinguishing cer- 
tain characteristics for the messages (e.g.. unreliable 
delivery or unspecified delivery order). 

To identify different types of messages, it has 
proven practical to allow for more than one named 
communication channel per session [8]. Clients in- 
terested in a particular type of message can subscribe 
to its respective channel. That way, the different 
clients reflecting the different roles only receive the 
information they need. 

To allow for a reply to the messages sent, delayed 
return values for asynchronous method invocation 
can be implemented by creating a handle to represent 
the reply expected in the future. This reference can 
then be used to wait for the reply or to forward it 
somewhere. An extension of Java providing this is £, 
described in [9J. 

Other, more sophisticated models can be built on 
top of this simple infrastructure. 

Model-View-Controller-based (MVC) ap- 
proaches with a replicated model, one view and 
one controller per user are very suitable for collab- 
oratively creating or modifying data. However, they 
tend to be inefficient if all changes to the model 
are immediately sent to all views. However. Gra- 
ham et al. report on significant performance advan- 
tages using optimized network communication [10]. 
If a more sophisticated strategy is used to transmit 
changes only when needed, application specific se- 
mantics have to be considered. This can be quite 
difficult to implement. 

Gutfreund ct al. suggest a Linda-like simplified 
tuple space model for coordinating distributed appli- 
cations (1 1.12]. This is a very lightweight approach 



to the MVC pattern that provides a generic server and 
requires all semantics in the client objects. This can 
also be used for simple message passing, since clients 
may register for changes on tuples. Lotus g has a sim- 
ilar approach, using "things" as state-keeping objects 
that also provide notification services [13]. 

For synchronisation and floor control, token- 
based mechanisms can be used to ensure data 
integrity [14]. This is an easy way to use collabora- 
tion-unaware programs collaboratively, but severely 
restricts interaction between participants. 

Finally, real-time multimedia streaming requires 
sophisticated network protocols with different qual- 
ity of service characteristics. It should be possible to 
integrate these services and have them controlled by 
the application. 

All these different approaches are very suitable 
for certain tasks, whereas they are inconvenient or 
inefficient for others. We believe that these models 
should complement each other rather than compete. 
A framework for group communication should not 
be bound to a particular model, but provide different 
models for the developer to choose from. 

To allow for software reusability, it should be 
possible to combine different session types to form 
a new type. This can be done recursively to cre- 
ate a hierarchy of components and to dynamically 
add new ways of communication to a session when 
required. When a client joins a session containing 
a subsession, it automatically joins the subsessions 
available in that session, too. 

3.4. Integration into the Web 

For integrating group communication into the Web. 
a session should run within a Web page using standard 
Web browsers. That way, they can be combined with 
HTML data containing general information about the 
session or its topic. Also, sessions can be embedded 
into WWW pages about a particular subject to let peo- 
ple, interested in this subject, communicate with each 
other and create a virtual community. 

Standard WWW mechanisms can be used for 
directory services and bookmarking. Sessions can be 
embedded into MIME email messages so that they 
are automatically started when the recipient reads 
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the email message. This is a eonvenient way to 
send invitations for sessions. Within a session, Web 
pages can be shown to all participants to point to 
information relevant for the current topic. 



4. Promondia: a proof-of-concept 
implementation 

Wc have implemented a system, called Promon- 
dia, providing group communication functionality 
for the WWW. An early version of our system is 
known under the name COMO. 

Promondia consists of a server program and ses- 
sion starters implemented as Java applets, which are 
embedded in HTML documents. These applets are 
called session-management applets. 

When the user views an HTML document that 
contains a reference to a session-management applet, 
the browser loads this applet via HTTP and starts it. 
The applet does not necessarily have to be located 
on the same WWW server as the HTML page con- 
taining it. However, access via certain WWW servers 
can be restricted. 

When the applet is started, it connects to the 
Promondia server and requests to join the specified 
session. If that session does not exist, a new session 
is created on the server by instantiating the specified 
session-server object. When a session is joined, the 
session-starter applet instantiates the session client 
reflecting the participant's role in the session. 

4.1. Session server 

The Promondia server is a stand-alone Java ap- 
plication. It is entirely written in Java and does not 
require any native code. Thus, it runs on any platform 
for which a Java interpreter is available. 

The server includes an HTTP server. This is very 
convenient for the following reason: Current Web 
browsers allow applets to connect only to the host 
they were loaded from. Otherwise, it would be pos- 
sible to bypass a lirewall with a malicious applet. 
Thus, the Java applets must be loaded from the sys- 
tem the Promondia server is running on. This can be 
achieved by using the huilt-in HTTP server. As an 
alternative, a separate standard HTTP server can be 
set up on the same machine. 



A group- management module organizes and co- 
ordinates the group-communication channels. It also 
provides information about active sessions. It sup- 
ports access control on a general, a per-session type, 
and a per-session basis. Configuration can be done 
remotely using special Java applets. 

The multithreaded architecture makes up for per- 
formance weaknesses that are due to the fact that 
the Java byte code has to he interpreted. Without 
just-in-time compilation, a 133 Mhz Pentium PC can 
handle about 100 simultaneous connections. A Sun 
UltraSparc workstation can handle several hundred 
connections at the same lime. Using KafTe [15], a 
freely available runtime system for Java, 350 simul- 
taneous connections have been handled easily on a 
Pentium 120 running Linux. 

4.2. Session management applets 

Two session management applets have been im- 
plemented to embed sessions into an HTML docu- 
ment: 

• A PageStarter simply joins a specified session 
and displays its GUI within a Web page. Visual 
characteristics can be specified in the HTML doc- 
ument. 

• A Float Starter can include advertisements or a 
help menu and lets a user detach the applet from 
the Web page by opening a separate window. 
Thus, the session can be run independently from 
the Web page. 

Other session management modules have been 
implemented as sessions. That way. they can use the 
communication mechanisms provided by the system 
to distribute information about available users and 
session. 

4.3. Implemented session types 

A few session managers and communication 
methods have been implemented to demonstrate the 
system. 

4.3.1. Manager Session type 

A Manager Session provides information about 
which sessions are running and which users are 
available. This is an example of how a session can 
be used for multicast distribution of information. 
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Fig. 3. The Manager-Session diem for administrators. 



It provides two roles: user and administrator. 
A user can select sessions and join them, which 
will download the client code for the session and 
spawn a new window. Invitations can be sent via 
email; the recipient can join the session simply by 
reading the email with a Java-enabled browser. The 
administrator can change the properties and configu- 
ration of the sessions and expel users from a session. 
Fig. 3 shows a screens hot of a Manager-Session 
client. 

43.2. Text-based Chat 

The Chat system offers a variety of chat modules: 
a simple one and several more sophisticated ones, 
which allow for private and anonymous messages, 
notification at certain events, and audio messages. 
Users can direct other participants to Web pages by 
typing a URL. Fig. 4 shows a Chat client with an 
ongoing session. A Moderator client lets the admin- 
istrator control what is being said in the session by 
assigning different rights (free speech, moderated, 
muted) on how to take part in the conversation. 

Promondia Chat sessions can be connected to 
IRC channels. The information transferred is mapped 
both ways. If the session is moderated, the IRC 
channel will also be moderated. 



43 3. Shared Whiteboard 

The Sliared Whiteboard is a multi-user vector-ori- 
ented drawing program. Lines, polygons, rectangles, 
ovals, text, and imported images can be added to a 
shared canvas (see Fig. 5). This kind of functionality 
can be used as a subscssion of a conferencing system 
to illustrate ideas or discuss designs. 

43 A. Voting and surveys 

A VoteModerator applet lets an administrator con- 
figure a question and a set of suggested answers. 
The users can choose one of the answers to cast 
their votes using a Voter client. The results can be 
transmitted automatically or after approval by the 
moderator. In Fig. 6, a Voter is shown. The already 
known votes are presented as colored bars behind the 
suggested answers. 

43.5. Games 

A simple board game 10 for 2 to 10 players has 
also been implemented to illustrate that multi-player 
games can be seen as a form of group communica- 
tion with verv formal interaction rules. 
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Fig. 4. A Chat client. 



5. Related work 

Most work on Web- based collaboration focuses 
on asynchronous database- and transaction-oriented 
cooperation. An example is the BSCW system 
[16.17] developed as pan of the CoopWWW project 
118|. Current implementations of synchronous com- 
munication for the Web are mostly limited to chats 
or other predefined functionality* 

GroubWeb \ \9\ is a collaborative Web Browser 
that is built on top of GroupKit [7]. This approach 
can even make Web hrowsing a fully collaborative 
activity; however it does not improve accessibility 
of group communication, since proprietary software 
has to be installed. The same applies to RAVE [20]. 
which uses browser plug-ins for presenting real-time 
data feeds. CoWeb |21) solves these problems by 
using Java, but it is not programmable. 

GmCo is a simple, but flexible system based 
on message passing and implemented for the alpha 



release of Java [22,23]. The data objects sent in the 
messages are restricted to certain types. 

Mushroom is a very promising and ambitious 
project that focuses on an intuitive user interface. A 
room metaphor is used to represent a session. Persis- 
tent data can be stored in a room for asynchronous 
cooperation. Users may enter a room and meet for 
synchronous cooperation 1 24,25]. A Java-based im- 
plementation is under way, but not available yet. 

JavaSoft is working on a collaboration API [14] 
that is based on ITU recommendation T.122 [8]. This 
API does not yet cover higher-level functionality for 
session management and configuration. 



6. Conclusion and future work 

We have discussed some of the issues involved in 
designing an architecture for a group communication 
infrastructure for the World Wide Web. We believe 
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that building such an infrastructure can help peo- 
ple to communicate more efficiently, since it would 
make these application more accessible. 

Promondia. our award-winning M implementation 
of such a system is freely available for academic and 
non-profit organizations at http://www4.informatik. 
uni-crlangen.de/promondia/. It is used to provide 
chat services on many public Web sites 12 in the In- 
ternet. It is unique in that it is the only flexible, high 
level infrastructure currently available for building 
synchronous group-communication applications for 
the Web. However, it lacks many of the features 
necessary for a globally usable system. 

There is still a lot of work to be done. In partic- 
ular, we arc currently working on a scalable reliable 
multicast backbone. The programming models that 
have been suggested in the past can be combined 
to let the developer choose the model that is most 
suitable for the task. However we feel that it would 
be helpful to integrate these models in a way that 
allows for component reusability. For that, we are 
currently investigating the usability of a model with 
a hierarchy of light weight communication channels 
and with a very weak distinction between channels 
and sessions. 
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